Distillation of Transformer Models
NeuralTuringMachineshaveexternalmemorythattheycanreadandwriteto.AttentionalInterfacesallowRNNstofocusonpartsoftheirinput.,Avisualoverviewofneuralattention,andthepowerfulextensionsofneuralnetworksbeingbuiltontopofit.Distillisdedicatedtoclearexplan...。參考影片的文章的如下: